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April 7, 2000 

Ms. Jill Morningstar, Legislative Assistant 
U. S. Senator Paul Wellstone 
L36 Hart Senate Office Building 
Washington, DC 20510 

Dear Jill: 

We very much appreciate your seeking the endorsement of the American Psychological Association (APA) 
for the bill introduced by Senator Wellstone this week on Fairness and Accuracy in Student Testing. We 
wanted to get back to you quickly with an interim response about the Association's views. 

As we mentioned on the phone earlier this week, APA is not in a position to endorse the bill at this time. 
APA policy on this issue is guided by The Standards for Educational and Psychological Testing , the 1999 
version of which was developed over a six-year period with the American Educational Research 
Association and the National Council on Measurement in Education. While we are drafting a more 
comprehensive response for the Senator, we wanted to let you know the general areas of our concerns. 
First, we believe that the most fair and appropriate way to approach the problems Senator Wellstone seeks 
to address is to investigate high stakes decision-making in educational settings, how such decisions are 
informed, and the impact on educational outcomes. The critical issue is not tests, per se (provided they are 
valid, reliable instruments) but rather the instances wherein they and other measures of accountability (i.e. 
grades), are used inappropriately in making these decisions. Second, the bill appears to misstate the intent 
of both the Standards and the National Academy of Sciences (NAS) report by holding tests for tracking, 
retention and graduation to die same standard. The NAS study and the Standards treat those uses 
differently. Both documents treat graduation tests as certification tests, and do not prohibit assigning them 
determinative weight but offer recommendations for implementing them fairly. 

We understand the Senator wants to offer this amendment to the Elementary and Secondary Education Act 
soon and would appreciate APA’s assistance in doing so. Regrettably, APA has not yet adopted a policy 
that goes beyond the Standards to address the issues of how high stakes decision-making in schools affects 
education and educational outcomes more broadly, particularly for certain groups of children. 

Accordingly, it would be impossible for us to offer you language that the Association could endorse within 
your timeframe. However, we would welcome the opportunity to work with you on your Plan B: an 
amendment to authorize specific evaluation research that would provide additional needed information 
about the impact of high stakes decision-making on the educational opportunities of different populations 
of children, and on the educational system more generally. The NAS report and other sources point to 
significant gaps in the data. We hope to provide specific draft language to you late next week. 

Again, we appreciate your willingness to work with us. Senator Wellstone has been a champion of many 
of the issues most important to psychologists, and we are grateful to him. 


Sincerely, 




Ellen G. Garrison, Ph.D. 

Director of Public Interest Policy 


Patricia C. Kobor 
Director of Science Policy 
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S.L.C. 


106th CONGRESS 
2d Session 



IN THE SENATE OF THE UNITED STATES 

* * 


Mr. WELL3TONE introduced the following bill; which was read twice and re¬ 
ferred to the Committee on _ 


A BILL 

To provide for fairness and accuracy in student testing. 

1 Be it enacted by the Senate and House of Representa- 

2 fives of the United States of America in Congress assembled, 

3 SECTION 1. FAIRNESS AND ACCURACY IN STUDENT TEST- 

4 ING. 

5 (a) Findings. —Congress finds the following: 

6 (1) The use of large-scale achievement tests in 

7 education has grown significantly in recent years. 

8 States and local school districts have increasingly 

9 used these tests in such contexts as raising" student 

10 academic standards to make high-stakes decisions 

4 

11 with important consequences for individual students, 
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1 such as tracking (assigning students to schools, pro- 

2 grams, or classes based on achievement level), pro- 

3 motion of students to the next grade, and gradua- 

4 tion of students from secondary school. 

5 (2) The serious and often adverse consequences 

6 resulting from the sole or determinative reliance on 

7 large-scale tests have increasingly resulted in ques- 

8 tions and significant concerns by students, parents, 

9 teachers, and school administrators about how to en- 

10 sure that such tests are used appropriately and in 

11 a manner that is fair. 

12 (3) In 1997, Congress directed the National 

13 Academy of Sciences to “conduct a study and make 

14 * written recommendations on appropriate methods, 

15 practices, and safeguards to ensure that, among 

16 other things,... existing and new tests that are used 

17 to assess student performance are not used in a dis- 

18 criminatory manner or inappropriately for student 

19 promotion, tracking, or graduation.”. 

20 (4) In 1999, the National Academy of Sciences, 

21 through its National Research Council, completed its 

22 study and issued a report entitled ‘High Stakes: 

23 Testing for Tracking, Promotion and Graduation’. 

24 Guided by principles of measurement validity, attri- 

25 bution of cause, and effectiveness of treatment, the 




/ 


1/ 


S00@ 


XVJ HH1 OO/OC/CO 

AERA APA NCME 0031818 




(Case 11 mmvQOOGeS^-TlSC QarnuirraittmSB FRitetiH232M/IB FFfeEggpsSBacfflEB 


900 Eg) 


O:\KOS\KOSOQ.280 


S.L.C- 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


National Research Council made key findings for ap- 




propriate test use in educational settings, including 
the following: 

(A) When tests are used in ways that meet 
relevant psychometric, legal, and educational 
standards, students' scores provide important 
information, that combined with information 
from other sources, can lead to decisions that 
promote student learning and equality of oppor¬ 
tunity. 

(B) Tests are not perfect. Test questions 
are a sample of possible questions that could be 
asked in a given area. Moreover, a test score is 
not an exaet measure of a student’s knowledge 
or skills. 

(C) To the extent that all students are ex¬ 
pected to meet world-class standards, there is a 
need to provide world-class curricula and in¬ 
struction to all students. However, in most of 
the Nation, much needs to be done before a 


-class curriculum and world-class instruc¬ 
tion will be in place. At present, curriculum 
does not usually place sufficient emphasis on 
student understanding and application of con¬ 
cepts, as opposed to memorization and skill 
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mastery. In addition, instruction, in core sub¬ 
jects typically has been and remains highly 
stratified. What teachers teach and what stu¬ 
dents learn vary widely by track, with those in 
lower tracks receiving far less than a world- 
class curriculum. 

(D) Problems of test validity are greatest 
among young children, and there is a greater 
risk of error when such tests are employed to 
make significant decisions about children who 
are less than 8 years old or below grade 8, or 
about their schools. However, well-designed as¬ 
sessments may be useful in monitoring trends 
in the educational development of populations 
of students who have reached age 5. 

(5) The National Research Council made the 
following recommendations: 

(A) If parents, educators, public officials, 
and others who share responsibility for edu¬ 
cational outcomes are to discharge their respon¬ 
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sibility effectively, they should have access to 
information about the nature and interpretation 
of tests and test scores. Such information 
should be made available to the public and 
should be incorporated into teacher education 
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and into educational programs for principals, 
administrators, public officials, and others. 

(B) A test may appropriately be used to 
lead curricular reform, but it should not also be 
used to make high-stakes decisions about indi¬ 
vidual students until test users can show that 
the test measures what they have been taught. 

(C) High-stakes decisions such as tracking, 
promotion, and graduation should not automati¬ 
cally be made on the basis of a single test score 
but should be buttressed by other relevant in¬ 


formation about the student’s knowledge and 1 

i 

skill, such as grades, teacher recommendations, 
and extenuating circumstances. 

(D) In general, large-scale assessments 
should not be used to make high-stakes deci- ^ 
sions about students who are less than 8 years 

i 

old or enrolled below grade 3. 

(E) High-stakes testing programs should 
routinely include a well-designed evaluation 
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component. Policymakers should monitor both 
the intended and unintended consequences of 
high-stake assessments on all students and on 
significant subgroups of students, including nii- 
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1 norities, English-1 anguage learners, and stu- 

2 dents with disabilities. 

3 (6) These principles and findings of the Na- 

4 tional Academy of Sciences are supported in signifi- 

5 cant measure by the Standards for Educational and 

6 Psychological Testing, adopted and approved in De- 

7 cember of 1999, by the leading experts and profes- 

8 sional organizations on testing, including the Arrier- 

9 ican Educational Research Association, American 

10 Psychological Association, and the National Council 

11 on Measurement in Education. 

12 (b) Test Performance . —If performance on a 

13 standardized test is considered as part of any decision 

14 about the retention, graduation, tracking, or within-class 

4 

15 ability grouping of an individual student by a State edu- 

16 cational agency or local educational agency that receives 

17 funds under the Elementary and Secondary Education Act 

18 of 1965, such test performance shall not be the sole deter- 

19 minant in such decision and may be considered in making 

20 such decision only if— 

21 (1) the test meets professional standards of va- 

22 lidity ancl reliability for the purpose for which the 

23 test’s results are being used, including the validity 

24 and reliability of any cut score or performance 

25 standard set or established for use on the test; 
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(2) the test allows its users to make score inter¬ 
pretations in relation to a functional performance 
level, as distinguished from those interpretations 
that are made in relation to the performance of oth¬ 
ers, is based on State or local content and perform¬ 
ance standards, and is aligned with the curriculum 
and classroom instruction; 

(3) multiple measures of student achievement 
are utilized, including grades and evaluations by 
teachers, so that scores from large-sc ale assessm ents 
are never the only source of information used nor 


assigned determinative weight in making a high- 
stakes decision ab out an ind i vidual student; 

(4) students tested have been provided multiple 
opportunities to demonstrate proficiency in the sub¬ 
ject matter covered by the test’; 

(5) . the test is administered in accordance with 
the written guidance from the test developer or pub¬ 


lisher; 


(6) the State educational agency or local edu¬ 


cational agency has evidence that the test is of ade¬ 
quate technical quality fpr each purpose for which 

the test is used; 

* 

(7) the State educational agency or local edu- 

* 

cational agency provides appropriate aceommoda- 
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tions and alternate assessments for students with 

* 

disabilities that provide the students with a valid op¬ 
portunity to show what they know and can do; 

(8) the State educational agency or local edu¬ 
cational agency provides appropriate accommoda¬ 
tions for students with limited English proficiency, 
including— 

(A) if such a student is tested in English, 
the student received academic instruction pri¬ 
marily in English for at least 3 years prior to 
the test, or if the student received instruction in 
English for more than such 3 years, the local 
educational agency determines that the student 
has achieved sufficient English proficiency to 
ensure that the test will accurately measure the 
student’s subject matter knowledge and skills; 

(B) in the case of' students with limited 
English proficiency who have not been taught 
primarily in English for 3 years prior to the 
test, such students are assessed, to the greatest 
extent practicable, in the language and form 
most likely to yield accurate and reliable infor¬ 
mation about what those students know and 
can do; and 
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(C) in the case of Spanish-speaking stu- 

i 

dents with limited English proficiency, such 
students are assessed using tests developed and 
written in Spanish, if Spanish language tests 
are more likely than English language tests to 
yield accurate and reliable information on what 
those students know and can do; and 
(9) the test is not used for a decision about 
promotion or placement in special education for a 
child below the age of 8 or grade 3. 

(c) Evaluations.— 

(1) State educational agencies. —Each 






State educational agency that receives funds under 
the Elementary and Secondary Education Act of 
1965 and uses a standardized test as part of a high 
stakes decision described in subsection (b), shall con¬ 
duct a comprehensive evaluation of the impact of the 
test’s use on students' education and educational 
outcomes, with particular consideration given to the 
impact on individual students and subgroups of stu¬ 
dents disaggregated by socioeconomic status, race, 
ethnicity, limited English proficiency, disability, and 
gender. The State educational agency shall make the 


results of the evaluation available to the public and 
shall provide clear and comprehensible information 


STOP) 


TVA 8 7. : ft T flTT.T. n ri /ac/on 

AERA APA NCME 0031825 












(Case 11 amawaMSSE^-TTSi: tmrnuirmrttmSB fRitetiH232Mffi9 ff&gpellScrfflEB 


4 ' 


/ 


O:\KOS\KOS00.280 

1 

2 

3 

4 

5 

6 
7 

• 8 
9 
10 
11 
12 

13 

14 

15 

16 

17 

18 

19 

20 

.. 21 . 

22 

23 

24 


S.L.C. 

10 


about the nature, use, and interpretation of the test 
and the scores the test generate. 

(2) Local educational agency.— Each local 
educational agency that receives funds under the El¬ 
ementary and Secondary Education Act of 1965, 
uses a standardized test as part of a high stakes de¬ 
cision described in subsection (b), and is located in 
a State that does not conduct an evaluation under 
paragraph (1), shall conduct a comprehensive eval¬ 
uation of the impact of the test’s use on students’ 
education and educational outcomes, with particular 
consideration given to the impact on individual stu¬ 
dents and subgroups of students disaggregated by 
socioeeonomic status, race, ethnicity, limited English 
proficiency, disability, and gender. The local edu- 

t 

cational agency shall make the results of the evalua¬ 
tion available to the public and shall provide clear 
and comprehensible information about the nature, 
use, and interpretation of the test and the scores the 
test generate. 


(3) Depabtment" of education'.— -The Sec-. 


retarv shall— 

(A) conduct an evaluation similar to the 
evaluation described in paragraph (1) among a 
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11 

1 representative sample of States and local edu- 

* 

2 cational agencies; 

3 (B) report the results of such evaluation to 

4 Congress; and 

5 (C) make the results of the evaluation 

6 available to the public. 

7 (d) Definition of Standardizes Test. —In this 

8 section the term “standardized test” means a test that 

9 is administered and scored under conditions uniform to 

10 all students so that the test scores are comparable across 

11 individuals; 
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